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ABSTRACT 

We revisit the main H i-to-stellar mass ratio (gas fraction) scaling relations, taking 
advantage of the HI spectral stacking technique to understand the dependence of 
gas content on the structural and star formation properties of nearby galaxies. This 
work uses a volume-limited, multi-wavelength sample of ^25,000 galaxies, selected 
according to stellar mass (10 9 M 0 ^ M* ^ 10 115 M 0 ) and redshift (0.02 < 2 < 0.05) 
from the Sloan Digital Sky Survey, and with Hi data from the Arecibo Legacy Fast 
ALFA survey. We bin according to multiple parameters of galaxies spanning the full 
gas-poor to -rich regime in order to disentangle the dominance of different components 
and processes in influencing gas content. For the first time, we show that the scaling 
relations of gas fraction with stellar mass and stellar surface density are primarily 
driven by a combination of the underlying galaxy bimodality in specific star formation 
rate and the integrated Kennicutt-Schmidt law. Finally, we produce tentative evidence 
that the timescales of HI depletion are dependent upon galaxy mass and structure, at 
fixed specific star formation rate. 

Key words: galaxies: evolution - galaxies: fundamental properties - galaxies: pho¬ 
tometry - galaxies: ISM radio lines: galaxies 


1 INTRODUCTION 

Interstellar neutral atomic hydrogen (H i) is the principle 
gaseous component of galaxies and supplies the raw fuel 
for star formation. As a result, understanding the proper¬ 
ties of HI is critical if we are to form a complete picture of 
galaxy evolution. There are strong correlations and complex 
relationships between galaxy gas content and the proper¬ 
ties of colour, shape, size and star formation (see reviews 
by [Kenni cutt|| 1998| |Boselli fc Gavaz~zl||2006| and references 
therein). While it is known that these parameters are reg¬ 
ulated by the processes that govern galaxy evolution and 
the timescales over which they occur, the extent of the in¬ 
terplay between global galaxy properties and the cold gas 
component remains unclear. 

Until recently, large area comprehensive surveys of ex- 
tragalactic Hi were non-existent. This meant that studies 
of cold gas lagged far behind those of stellar populations in 
both statistics and detail. However, over the past decade, 21- 
cm line observations have dramatically increased the quan¬ 
tity and quality of data available, characterising the neutral 
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hydrogen content for thousands of galaxies. The Hi Parkes 
All-Sky Survey (HIPASS; Meyer et al. 2004) contains ~5,000 
Hl-selected galaxies detected in emission out to ~180 Mpc 
and was the first blind HI survey to provide such a sample. 
Its successor, the Arecibo Legacy Fast ALFA (ALFALFA; 
Giovanelli et al.[ 2005) survey is measuring HI emission over 
7000 deg 2 of sky out to a distance of ~250 Mpc, reaching 
nearly eight times better sensitivity, four times better an¬ 
gular resolution and three times the spectral resolution of 
HIPASS. The latest data release of ALFALFA covers forty 
per cent of the total survey area and contains 15,855 indi¬ 


vidually detected sources (a.40 H I Source Catalog; Haynes 
et al. 2011). In addition to blind surveys, deep observa¬ 


tions have measured global and resolved HI properties for a 


smaller yet growing number of galaxies (e.g. van der Hulst 


et al. 2001 Walter et al. 20081. The largest targeted cen¬ 


sus of HI for a representative sample to date is the GALEX 
Arecibo SDSS Survey (GASS; Catinella et al. 2010), provid¬ 
ing HI quantities for a stellar mass selected sample of ~800 
nearby galaxies with stellar masses greater than 10 10 Mg. 


At other wavelengths, large scale spectroscopic and pho¬ 
tometric surveys such as the Galaxy Evolution Explorer 


(GALEX; Martin et al. 2005 Morrissey et al. 20071 sur 
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veys in the UV, the Sloan Digital Sky Survey (SDSS; |York 
et al.||2000|l in the optical, the Two-Micron All Sky Survey 


(2MASS; Skrutskie et al.|2006 1 in the near-infrared and the 


Wide-held Infrared Survey Explorer (WISE; Wright et al. 


2010) cover a significant fraction of the sky and provide 
a plethora of multi-wavelength data for 10° — 10 6 objects. 
Combining such survey efforts across the entire breadth of 
the electromagnetic spectrum allows for an increasingly in- 
depth and statistical approach to the study of galaxy popu¬ 
lations and their properties. 

Sensitivity limitations of current radio telescopes mean 
that direct detection of Hi emission for large numbers of 
galaxies is limited to the local Universe (|Catinclla fc Cortesej 


20151. Deep targeted observations probe lower gas content 


effectively but are exceptionally time expensive and as such 
are unable to provide samples capable of statistically match¬ 
ing the scope and scale of the UV, optical and IR surveys. 
On the other hand, blind Hl-surveys cover huge volumes 
and large numbers of galaxies but are not sensitive enough 
to yield detections for the lower end of the HI mass function 
beyond the very closest systems. 

However, we may exploit H i-blind surveys beyond their 
nominal sensitivity limits. Taking advantage of the spectral 
stacking technique, we compute average H i-to-stellar mass 
ratios (gas fractions) from high quality 21-cm line spectra 
of galaxies that are not necessarily individually detected in 
Hi, yet selected using optical coordinates and redshifts. Ul¬ 
timately, until next generation facilities such as the Square 
Kilometre Array (SKA) survey vast areas of sky with un¬ 
precedented speed and sensitivity, the stacking of HI spec¬ 
tra for an ensemble of detected and undetected galaxies is 
the best (and only) way to accurately quantify the gas frac¬ 
tion for such large, representative samples of galaxies. The 
method has been previously employed with great success 


2011 


to probe lower gas content in both emission (Fabello et al. 
2012 Gereb et al.||2013l and absorption (Gereb et al. 


20141, and to estimate the HI cosmic density at intermediate 
redshift (e.g. Lah et al.|2007 Rhee et al.|2013 1. 


There have been many studies into the relationships 
of gas content with star formation and galaxy structure for 


thousands of systems. Fabello et al. (2011) applied HI stack¬ 


ing to a large multiwavelength dataset, quantifying the main 
scaling relations of gas fraction with stellar mass, stellar sur¬ 
face density and colour for ~5,000 massive galaxies. Works 
such as Kannappan (20041, Cortese et al. (201 1|) , Oh et al. 


(|2011[) and |Catinella et al.| (2013[) have looked at the same 


relations using targeted, deep observations. All these inves¬ 
tigations showed a strong negative trend of gas content with 
stellar mass, stellar surface density and colour, identifying 
stellar surface density and colour, respective morphological 
and star formation indicators, as the two properties most 
tightly correlated with gas fraction. 

Such studies highlight the role of internal structure and 
star formation in the regulation of atomic gas content. Un¬ 
derstanding the extent and causality of this relationship is 
of vital importance and has been the focus of much recent 
work (e.g. Fabello et al.|20lT Cortese et al.|20Tl Catinella 


|et al.|[2013| ). However, the processes are not as yet fully 

understood and studies have frequently arrived at differing 
conclusions, either downplaying the importance of a bulge 
or bar in affecting gas content or, contrastingly, suggesting 
such structures may have an influence upon gas consump¬ 


tion (e.g. Saintonge et al. 2012 Leroy et al. 2013 Huang 
et al.|20l4 1 . 


In this paper we use stacking analysis with unprece¬ 
dented statistics to build comprehensive scaling relations 
with gas content, investigating the separate influences of 
mass, structure and star formation on cold gas for the entire 
gas-poor to -rich regime. 

The paper is organised as follows. Section [2] describes 
the sample selection criteria and multi-wavelength data used 
in this work. A brief overview of the Hi spectral stacking 
technique and the errors involved is given in Section [3] The 
results, scaling relations and analysis are presented in Sec- 
tion[4]and, finally, we compare our work with previous stud¬ 
ies and discuss implications in the context of galaxy evolu¬ 
tion in Section [5] 

Throughout this paper the distance dependent quan¬ 
tities are computed assuming a ACDM cosmology with 
Q = 0.3, A = 0.7, and a Hubble constant Ho = 
70 km s -1 Mpc -1 . 


2 THE SAMPLE 
2.1 Selection Criteria 

We define a volume-limited, stellar mass selected parent 
sample of 30,695 galaxies from the intersection of the Sloan 


Digital Sky Survey Data Release 7 (SDSS DR7; Abazajian 


et al. 2009) footprint and sky region for which ALFALFA 


data have been processed to date. 

The stellar mass, M*, and volume cuts applied are as 
follows: 


10 9 M 0 ^ M* ^ 10 11 ' 5 M 0 
112.5° ^ a «: 247.5° 

0° ^ <5 s: 18° & 24° S s; 30° 
0.02 s; 2 ^ 0.05 


Stellar mass estimates, derived via photometric fitting, are 
taken from the MPA-JHU SDSS DR70 catalogue. The sam¬ 
ple straddles the ‘transiti on mass’ in galaxy popu lations of 
3 x 10 10 M 0 identified by Kauffmann et ahj ( ] 2003 [ ) . 

The significant contribution of peculiar velocities to 
galaxy redshift measurements is avoided by setting a lower 
redshift limit of 2 ^ 0.02 while the frequency range over 
which the San Juan airport radar affects Arecibo’s observa¬ 
tion of redshifted HI emission is removed by the ceiling of z 
sj 0.05. 


The sample also lies in the sky footprint of the GALEX 
surveys. Therefore, for this work, the sample is further re¬ 
duced in size because we include only those galaxies de¬ 
tected in near-ultraviolet (NUV, see Section [2.2| for details). 
Finally, we discard an additional one per cent of the sam¬ 
ple due to radio frequency interference (RFI) contamination 
of ALFALFA Hi spectra. A full description of the SDSS 
optical, GALEX NUV and ALFALFA H i data is given in 
sections 12.21 and 12.31 


1 http://www.mpa-garching.mpg.de/SDSS/DR7/ 

In this work we use the improved stellar masses from: 
http: //home, strw.leidenuniv. nl/r^jarle/SDSS/ 
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Figure 1. Distribution of sample galaxies across stellar mass, redshift, stellar surface density and NUV-r colour. The black histograms 
show the parent sample; the blue, dashed histograms represent the final sample of 24,337 galaxies used in this paper. 


The total sample of SDSS DRY galaxies with high qual¬ 
ity (i.e. uncontaminated by RFI) ALFALFA spectra and 
GALEX NUV photometry is 24,337 galaxies. 


2.2 Optical and UV Data 


Galaxy properties including redshift, size and magnitudes 
are obtained from the SDSS DR7 database server CasJob .0 
via Structured Query Language queries. Optical photomet¬ 
ric data are taken from five broad bands: u, g, r, i, z (AB 


system; Fukugita et al. 19961. For magnitudes and colours 


we take the ‘model’ magnitudes. These are the optimal fit 
of either a pure de Vaucouleurs or a pure exponential pro¬ 
file to the galaxy flux in each band. We compute distance 
measurements and rest-frame shift Hi spectra using fibre 
spectroscopic redshifts. 


The UV data are obtained by carefully cross- matching 
the GALEX Unique Source Catalogue^] (GCAT; Seibert 


et al. 20121 and the Bianchi, Conti, Shiao Catalo gue of 
Unique GALEX Source]^] (BCS; [Bianchi et al.|[2014 i to the 
SDSS database using an impact parameter of ten arcsec- 
onds. The two UV catalogues use very similar techniques 
and as such the difference between their photometries is 
small (^0.1 mag). Where galaxies are present in both cat¬ 
alogues we selected magnitudes from the GCAT as this has 
a larger overlap with our sample. GCAT does not include 
the GALEX GR7 data release and so, when necessary, we 
also draw values from the BCS catalogue. We use magni¬ 
tudes obtained from the Medium Imaging Survey (MIS; see 
Martin et al. [2005 ) images for the ninety-six per cent of the 


2 http://skyserver.sdss3.org/casjobs/ 


3 http://archive.stsci.edu/prepds/gcat/ 

4 http://archive.stsci.edu/prepds/bcscat/ 
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Figure 2. NUV-r colour versus stellar mass diagram for galaxies 
in our sample. Contours represent density levels from a minimum 
of 20 galaxies per bin up to 250 per bin. The points show positions 
in the plane for the full sample of 24,337 galaxies. Black points 
are the 4,610 galaxies detected in Hi by ALFALFA, grey points 
are undetected galaxies in the same HI observations. 


sample where it is available, and from the All-Sky Imaging 
Survey (AIS) for the remaining four per cent. 

All optical photometric data are corrected for Galac¬ 
tic extinction following |SchlegeI, Fin kbeiner fe Davis| (1 998[ ) 
and using the respective band extinction obtained from 
the SDSS DR7 PhotObj catalogue. Reddening correction 
for NUV photometry is according to Wyder et al. ( |2007[ ), 
who adopt A(A)/E(B-V) = 2.751 for SDSS r-band and 
A(A)/E(B-V) = 8.2 for GALEX NUV. Thus we convert to 
NUV extinction, Anuv, using Anuv— A r = 1.9807A r , where 
A r is the r-band extinction. 

Figure [I] shows the distributions of stellar mass, red- 
shift, NUV-r colour and stellar mass surface density for the 
galaxies in our sample. We derive the stellar surface density, 
n* , as: 




M* 

2^Rlo, z 


(1) 


where Rso,z is the Petrosian radius containing fifty per cent 
of the flux in z band, expressed in kpc. 

The distributions of the main optical- and UV-derived 
parameters for the parent sample (solid black line) and of 
the galaxies used in this work (hatched blue histograms) are 
given in Figure [T| The ratio between sample galaxies with 
GALEX data used in this work and their parent sample 
is similarly uniform in distribution across the four proper¬ 
ties. This illustrates that, when removing galaxies for which 
there are no valid NUV magnitudes, we do not inadvertently 
introduce bias by over or under sampling a region of the pa¬ 
rameter space. 

Figure [Id| is the NUV-r colour histogram for the sample 
and exhibits the well known blue cloud and red sequence bi¬ 
modality ( |Wyder et al.|2007 Schiminovich et al.|2007 l. The 
NUV-r colour is chosen as it probes young and old stellar 
populations on either side of the 4000A break and thus is 
sensitive to the star formation properties of a galaxy. 


2.3 Hi 21-cm Line Data 


The 21-cm Hi spectra are extracted from the full volume 
of ALFALFA data cubes using positional coordinates, a, 5 
and redshift, z, drawn from the SDSS DR7 database. The 
technique of optical selection from data cubes within the 
shared footprint of SDSS and ALFALFA provides spectra 
with excellent baseline characteristics for both detections 
and non-detections in Hi, each with optical counterpart in¬ 
formation. For examples of Hi detected and non-detected 
ALFALFA spectra as well as a full description of the extrac¬ 


tion procedure see Section 3 of Fabello et al. (2011). 


We extract spectra with a raw spectral resolution of 
~5.5 km s -1 and spanning ~5,500 km s _1 in velocity range 
(25 MHz in frequency) from a 4 x 4 arcminute aperture 
centred on the position of the target galaxy. The quality 
of each radio spectrum has been conservatively assessed by 
an automated routine which flagged spectra with spikes in 
the signal (more than ten times the root-mean-square, rms, 
value) within a central 1000 km s^ 1 interval centred on the 
redshift of the galaxy. We conducted a visual inspection of 
all spectra that failed this test and discarded those with 
central RFI. 

Figure [2] shows the NUV-r colour-mass diagram for 
our sample with individual ALFALFA detections and non¬ 
detections shown by black and grey points respectively. The 
superimposed galaxy number density contours show the 
classic bimodality, separating the blue, star forming from 
the red, quiescent galaxies. As can be seen, blind surveys 
such as ALFALFA preferentially detect blue cloud objects 
(NUV-r ^ 3.5) and though red sequence detections do oc¬ 
cur, they are rare, a result already noted (e.g. Huang et al. 
2012). 


Galaxies in our sample have Petrosian radii contain¬ 
ing ninety per cent of the r-band flux (Rgo.r) of less than 
one arcminute, hence their HI emission is always unresolved 
(Arecibo beam FWHM = ~3.5 arcminutes). 

The ALFALFA Hi detection (S/N Js 6.5) rate for our 
sample is nineteen per cent (4,610 galaxies). Reliable indi¬ 
vidual Hi flux densities for detected galaxies not already 
included in the a.40 ALFALFA data release were provided 
by M.P. Haynes. 

H i masses are computed via the standard formula: 

Mm _ ( 2.356 x 10 5 \ / D^) ) 2 ( S 2 i \ ( ) 

M 0 \ 1 + z ) \ Mpc ) \ Jy km s -1 ) 

where 2 is the redshift, D L ( z ) is the luminosity distance and 
S 21 is the integrated 21-cm line flux density. Gas fraction is 
simply taken as Mhi/M*. 


3 H 1 SPECTRAL STACKING 


With over eighty per cent Hi non-detections, the sample 
used in this work is ideally designed to exploit the capabili¬ 
ties of the stacking technique, allowing us to determine how 
H 1 content varies with galaxy properties. We use an adapted 


version of the software developed by Fabello et al. (2011). 


A more comprehensive description of the stacking technique 
can be found in Section 3.2 of that work. 

Stacking makes use of the fact that contained within 
the three dimensional volume of 21-cm data cubes is the 
Hi emission from galaxies regardless of whether they are 
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formal detections. Using the optical position and redshift, 
we extract 21-cm line spectra from the cubes at the locations 
of our sample galaxies. 

Each spectrum is shifted to its rest-frame velocity, so 
that the galaxy is centred at zero velocity, using the SDSS 
spectroscopic redshift for the corresponding optical coun¬ 
terpart. The mean values of redshift and stellar mass within 
each bin may not be representative of the individual galax¬ 
ies, therefore we weight each spectrum by its redshift and 
stellar mass before stacking in order to transform the radio 


signal into a ‘gas fraction spectrum’ (see eqn. 5 in Fabello 


et al. 2011). Fabello et al. (2011) find this method is con¬ 


sistent with - and when the bins are large preferable to - 
co-adding the raw spectra and using the mean redshift to 
convert the stacked signal into an average HI mass and sub¬ 
sequent gas fraction. The rest-frame spectra for N galax¬ 
ies are then weighted by their rms noise, and co-added, or 
stacked, yielding a total signal for the sample while reduc¬ 
ing the rms as l/\/N. Once the number of spectra stacked 
reaches the threshold N ~300 non-Gaussian noise begins to 
dominate, therefore the reduction in rms continues but at a 
decreased rate (Fabello et al. 2011). 

In some instances (e.g. where statistics in the bins is 
low or galaxies have red NUV-r colours) the stacked signal 
yields a non-detection. When this occurs upper limits are 
computed assuming a 5<r signal with a velocity width of 
W = 200 km s -1 for bins with (M*) ^ 10 10 M 0 and W = 
300 km s" 1 where (M*) > 10 10 M 0 , smoothed to a spectral 
resolution of (W/2) km s _1 . This approach is outlined in 


Giovanelli et al. (20051 and is identical to GASS over the 
same stellar mass range (M* > 10 10 M 0 ). 


3.1 Errors 


Errors on the average gas fractions obtained with stacking 
are computed using the Delete-a-Group Jacknife (DAGJK) 


method (Kott 2001), a statistical estimate of the standard 


deviation (or rms error) that incorporates both variance of 
the estimator and its bias. Based upon the standard Jacknife 


procedure formulated by Tukey (1958), DAGJK iteratively 


estimates a given population parameter (i.e. Hi gas frac¬ 
tion) while discarding, in turn, a separate random subset of 
galaxies from the sample. The weighted difference between 
the mean statistic measurement and each jackknifed statis¬ 
tic is an estimate of that group’s influence on the mean value 
and an indicator of the variance within the dataset. 

The DAGJK standard deviation estimator is: 


a{t) = y/ var(t ) = 


\ 


(R-i) 

R 


R 


r =0 


( 3 ) 


where R is the number of replicated estimates, t( r ) is the es¬ 
timation of the population parameter without the rtli subset 
and t is the mean of the replicated estimates. 

When computing the DAGJK error we discard twenty 
per cent of the sample without replacement on each itera¬ 
tion, meaning that the number of estimates, R, is five. 

It should be noted that the errors shown are not indica¬ 
tive of the standard deviation of the underlying distribution 
of individual gas fractions, this is unknown. Instead the er¬ 
rors show the effect of strong outliers, if present, on the final 


stacking result. When employed using random rejection the 
main advantage of DAGJK over the traditional ‘leave one 
out’ method is the improvement in computational efficiency 
while maintaining precision in the (nearly) unbiased confi¬ 
dence interval of the population parameter. The result is 
entirely consistent with the standard Jackknife technique. 
Additional advantages of DAGJK will be explored in fol¬ 
low up investigations, where we will estimate errors on the 
stacked average gas fraction by targeted rejection according 
to galaxy properties (i.e. NUV-r colour). DAGJK will be 
capable of giving an indication as to how particular galaxy 
populations influence the mean gas content by providing 
some insight into the variance of the stacking average. 


4 H i GAS FRACTION SCALING RELATIONS 


In this section we present the main scaling relations of gas 
fraction versus stellar mass, stellar surface density and NUV- 
r colour for our sample, based upon spectral stacking of AL¬ 
FALFA data. Compared to previous work by |Fabello et al.] 
([2011), which was based upon a subset of our sample (~5000 
galaxies), we are able to extend our analysis down one order 
of magnitude in stellar mass (M* ^ 1O 9 M 0 ) as well as in¬ 
vestigate second order trends in gas fraction, gaining further 
insights into the physical drivers of these relations. 

Figure [3] shows stacked average HI fraction as a func¬ 
tion of galaxy stellar mass, stellar surface density and NUV-r 
colour, shown by the solid red lines. Errors on the average 
Hi fractions are computed using DAGJK, as described in 
Section |3.1| Grey points indicate individual HI detections 
from ALFALFA. In Figures |3a| and |3c| we also plot the aver¬ 


age linear gas fraction^ [[Cortese et al.|2011 1, provided by L. 


Cortese, from the Herschel Reference Survey (HRS, Boselli 
et al. 2010], shown by the magenta dashed line. We exclude 
Hi-deficient galaxies - typically found within clusters - be¬ 
cause of their significant offset to lower gas content. We do 
not show the HRS results in Figure |3b| because of a dif¬ 
ference in the definitions of stellar surface density between 
their work and ours. We confirm the trends of decreasing HI 
fraction as a function of galaxy stellar mass, stellar surface 
density and NUV-r colour, even with the addition of lower 
stellar mass galaxies. Our results are entirely consistent with 
the results of Fabello et al. ( 2011| ) as well as the findings of 
Catinella et al. (20131, using the log of the linear gas frac¬ 


tion averages from the final GASS data release (dashed black 
line). The data points for the full sample scaling relations 
shown in Figure [3] are given in Table |T| 

The blue line in Figure [3a] shows the result of stacking 
only galaxies that are detected by ALFALFA, corroborating 
previous findings that HI selected samples, unless corrected, 


° The reader should note that the distribution of gas fractions 
in the local Universe is more likely lognormal than Gaussian, so 
ideally one would compute (log Mhi/M.) ( [Cortese et al.|[201 1 ). 
However, the stacking method does not operate in log space, in¬ 
stead it returns the linear average of the HI content, so we must 
adopt log (Mhi/M*) in its place. Care must be taken when com¬ 
paring our results with gas fractions from deep, detection domi¬ 
nated surveys such as GASS and HRS that the average is taken 
of the linear data, not of the log-scaled value, as the log of the 
average is not the average of the log. 
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Figure 3. Average stacked Hi gas fractions are plotted as a function of galaxy stellar mass (M*), stellar surface density (/x*) and NUV-r 
colour for the whole sample. Grey points are individual detections included in the ALFALFA catalogue. The magenta dashed line is 
average gas fraction of nearby Hi detections from the Herschel Reference Survey (HRS; Boselli et al., 2010). Dashed black line shows the 
scaling relation from GASS. Numbers represent the total number of co-added galaxies within each bin. The blue line in Figure [3a] is the 
stacked average of the galaxies that are detected in ALFALFA. The average gas fraction data points for each relation are given in Table 
□ 


will overestimate the average gas content of galaxies within 
the volume (see Figure 2 in Huang et al.|2012l. 


To take this analysis a step further, we must establish 
the importance of each of the main parameters as a tracer of 
gas content. Stellar surface density can be taken as a rough 
proxy for bulge-to-total ratio, a good morphological indica¬ 
tor, with the fraction of disk-dominated systems decreasing 
as /i* increases (Kauffmann et ah||2006 1. Within our sam¬ 


ple, more massive galaxies tend to have higher stellar sur¬ 
face densities and thus earlier morphologies. As the ratio of 
young to old stars, NUV-r colour is used as a proxy for spe- 


Schiminovich et al. 

20071. 

and Catinella et al. 

(2013]) 


Salim et al.|2005| 

20117 

in Cortese et al. ( 

2011) 


tiglitly correlated with p* and NUV-r colour. However, no 
previous study has had the requisite amount of galaxies to 


divide the sample by two parameters simultaneously. The 
large statistics afforded by our sample allows us to fix a pri¬ 
mary parameter while binning the sample in terms of a sec¬ 
ond, disentangling the individual dominance of stellar mass, 
stellar surface density and NUV-r colour in governing, or at 
least tracing, the average gas fraction of these galaxies. 

Figure [4a] shows gas fraction versus stellar mass, fixing 
stellar mass along the x-axis and splitting the sample into 
bins of stellar surface density. We then invert this in Fig¬ 
ure |4b| so that stellar surface density is held constant and 
we are separating galaxies according to their stellar mass. 
Blue, green and red lines denote the average gas fractions 
obtained by stacking. For comparison, the scaling relations 
for the whole sample from Figure [3] are shown by the dashed 
black line. Figure [4a] shows a large difference (~0.8 dex) be¬ 
tween average HI fraction for disk-dominated (low p*) and 
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Figure 4. Left: Hi gas fraction as a function of stellar mass, separated into low (blue), intermediate (green) and high (red) stellar surface 
density bins, as indicated on the top right. Right: Hi gas fraction as a function of stellar surface density, separated into bins of stellar 
mass. In both panels, grey points indicate ALFALFA detections and the numbers below the relations indicate the number of galaxies 
co-added in each bin. The dashed black lines show the scaling relations for the whole sample from Figures |3a| and |3b| respectively. Refer 
to Table |2| for the average gas fractions from the scaling relations. 
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Figure 5. Left: Hi gas fraction as a function of stellar mass, separated into bins of NUV-r colour, as indicated on the top right. Right: 
HI gas fraction as a function of NUV-r colour; separated into bins of stellar mass. Symbols and numbers as in Figure [T] Upper limits 
obtained for the bins where the stacked spectrum is a non-detection are shown as upside-down triangles. The dashed black lines reproduce 
gas fraction scaling relations for the whole sample from Figures |3a| and |3c| respectively. The gas fraction values are given in Table [3] for 
the average gas fractions for each scaling relation. 


bulge-dominated (high p*) galaxies at a given stellar mass. 
In contrast, Figure 4b has a smaller variation (~0.4 dex) 
in gas content across the sample’s mass range while holding 
stellar surface density constant, suggesting that the aver¬ 
age Hi fraction of a particular morphological class is, to 
a small degree, sensitive to the mass of the system. Using 
targeted Hi observations, Catinella et al. (2013) found that 
the distributions of gas fractions averaged in the stellar mass 
and stellar surface density relations have a standard devia¬ 
tion (tm, = 0.5 dex and crj,, = 0.4 dex respectively. As the 
spread in gas fraction between our second parameter bins is 


generally comparable to or larger than these values, we can 
safely conclude that the differences in gas fraction between 
the solid lines in Figure [4] are significant. 

Lastly, we note that the average gas fraction in the low¬ 
est stellar mass (M* < 1O 9,75 M0) and highest surface den¬ 
sity bin in Figure |4b| is markedly above the relation. The 
higher than expected gas fraction is likely due to the uncer¬ 
tainty involved in calculating stellar surface densities for the 
large fraction of compact objects within the bin, over esti¬ 
mating the stellar surface densities. When statistics is low 
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Figure 6. Left: Gas fraction as a function of stellar surface density, separated into blue, green and red sequence galaxies according to 
their NUV-r colour (solid lines) as indicated on the top right. Right: Gas fraction as a function of NUV-r colour, separating into bins 
of stellar surface density. Symbols and numbers as in Figure [4] The full sample scaling relations from Figures |3b| a nd |3c| are plotted as 
black dashed lines. Triangles denote non-detections, set to their upper limits. For the data points, refer to Table [4] 


Table 1. Average gas fractions for the full sample scaling rela¬ 
tions shown in Figure[3] The column labelled x is the galaxy prop¬ 
erty along the x-axis, ( x) denotes the mean values of a: within each 
bin, (Mhi/M*) is the linear gas fraction and N gives the number 
of galaxies within each bin. 


(x) (Mhi/M*) 


N 


log M* 


log M* 


NUV-r 


9.21 

1.511 

± 

0.011 

5506 

9.64 

0.643 

± 

0.011 

6904 

10.14 

0.232 

± 

0.005 

6290 

10.62 

0.096 

± 

0.002 

4424 

11.07 

0.039 

± 

0.001 

1213 

7.43 

1.976 

± 

0.041 

2349 

7.86 

1.045 

± 

0.014 

6388 

8.30 

0.417 

± 

0.010 

5342 

8.75 

0.159 

± 

0.002 

6265 

9.20 

0.053 

± 

0.003 

3926 

1.98 

1.827 

± 

0.029 

4407 

2.75 

0.685 

± 

0.005 

8095 

3.74 

0.211 

± 

0.012 

4112 

4.81 

0.104 

± 

0.002 

3062 

5.89 

0.034 

± 

0.003 

4661 


these galaxies dominate the average gas fraction measure¬ 
ment and the point should not be considered reliable. 

Similarly, in Figure [5] we split the sample by NUV-r 
colour instead of stellar surface density. The blue, green and 
red lines denote galaxies within the NUV-r colour bins - cho¬ 
sen to approximately correspond to traditional blue cloud, 
green valley and red sequence classifications - and stellar 
mass bins - chosen to span the transition mass of 10 10 Mq 
(Kauffmann et al. 20031, where there is an observed shift 
in the abundance of late, star forming galaxies to earlier, 
quiescent systems. Stacked spectra that remain undetected 
(upside-down triangles) are found exclusively on the red se¬ 


quence with NUV-r > 4.5. Non-detections are set to their 
upper limits (see Section [3|. 

Figure [5a] shows that for a given stellar mass the gas 
fraction varies significantly (~1.0 dex) across the NUV-r 
colour range, while in Figure [5b| the difference between av¬ 
erage gas fraction at fixed NUV-r colour for low and high 
mass systems is significantly less (~0.5 dex). Even in this 
case, these differences are larger than the typical standard 
deviation of the scaling relation as directly measured from 
detections (e.g. aNuv-r = 0.3 dex, |Catinella et al.||2013| ). 
The implication is that, once the trend of high stellar mass 
bins being dominated by redder galaxies is removed, the av¬ 
erage HI fraction for galaxies of given SSFR is only weakly 
dependent on mass. 

When splitting the gas fraction-stellar mass scaling rela¬ 
tion in terms of either surface density or NUV-r we find that 
the slope of the linear fit to the relation flattens considerably 
(V,^ = —0.45±0.01; V nuv-t = — 0.35±0.02) with respect to 
the whole sample (Vf u ii = —0.85±0.01) shown by the dashed 
black line in Figures [4] and [5] This clearly shows that the 
steep slope of the gas fraction-stellar mass relation arises as 
a result of preferentially stacking blue, gas-rich galaxy pop¬ 
ulations in low stellar mass bins and red , gas-poor systems 
in high mass bins. 

Having confirmed that stellar surface density and NUV- 
r colour dominate over stellar mass as tracers of atomic gas 
content, the next step is to test which one of these is the 
principal parameter in the determination of average HI frac¬ 
tion. The best method to address this is presented in Figure 
[6] the main result of this work. We remove the stellar mass 
constraints and show how the gas content varies when we 
fix surface density or NUV-r colour while binning in terms 
of the other. As with Figures [4] and [5] the non-detections 
occur exclusively in bins of NUV-r > 4.5, where the stack¬ 
ing of large numbers of galaxies is required to reduce the 
rms noise sufficiently for the signal to be detected. Figure 
[6a] demonstrates that galaxies at a given stellar surface den- 
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sity exhibit a spread (~1.0 dex) in Hi content across the 
NUV-r colour range (solid lines) from blue cloud to red se¬ 
quence. The difference of 1.0 dex is statistically significant 
when compared to the scatter in the gas fraction-stellar sur¬ 


face density relation found by Catinella et al. (2013), = 


0.4 dex. In contrast, we fix the colour in Figure |6b| while 
separating galaxies according to low, intermediate or high 
stellar surface density (solid lines). In this case the difference 
in gas fraction between the highest and lowest /.t* bins, i.e. 
the bulge- and disk-dominated systems, decreases to ~0.5 
dex on average. 

This implies that the most important quantity in trac¬ 
ing of neutral atomic hydrogen content is NUV-r colour. 
Making additional cuts in surface density, already a mass 
dependent quantity, does not significantly alter the values 
of gas fraction. In other words, galaxies of a similar colour 
are, on average, likely to contain similar ratios of cold gas to 
stellar mass, showing only a small dependence on size (e.g. 
mass) or morphology (e.g. surface density). 

While the strong relation between gas and NUV-r 
colour is evident, Figures |5b| and |6b| do demonstrate a 
smaller, residual dependence of gas fraction on stellar mass 
and density at fixed NUV-r colour. This result raises in¬ 
teresting questions surrounding the impact of stellar mass 
and surface density on gas consumption in addition to star 
formation, which will be discussed in Section [5] 

In addition, the slope of the gas fraction-surface density 
relation is flattened when binning galaxies by NUV-r colour. 
The linear fit to the full sample relation has a gradient of 
Vfuii = —0.89 ± 0.04, while selection by NUV-r reduces the 
slope to V NUV-r = —0.33±0.06. This reaffirms that selection 
by only surface density yields a mixed population of galaxies 
and shows that the surface density scaling relation is driven 
by the underlying correlation of gas content with NUV-r 
colour. 

To confirm the low affinity of gas fraction with stellar 
mass, we also split the scaling relations of Figure[6]into lower 
(M* < 10 10 Mq) and higher stellar mass (M* ^ 10 10 M 0 ) 
bins (not shown). The difference between the relations of 
Figure [6] for low and high mass galaxies was not signifi¬ 
cant, ~0.1 dex (a) and ~0.2 dex (b). This reaffirms that, 
within our sample, the variation of gas content across the 
stellar mass range is small, once morphology and/or SSFR 
of galaxies has been fixed. 

Finally, we stress that by looking at the effect of sec¬ 
ondary parameters on the HI scaling relations we have been 
able to put important constraints on the underlying gas frac¬ 
tion distribution of the stacked population. In other words, 
we have been able to overcome one of the main limitations 
of HI stacking, thus enhancing the scientific potential of this 
technique. 


5 DISCUSSION AND CONCLUSIONS 

In this work we have applied HI spectral stacking to a large 
sample of 24, 337 galaxies. Each galaxy is selected according 
to redshift and stellar mass from the SDSS DR7, with HI 
spectra and NUV data from the ALFALFA blind H i-survey 
and GALEX catalogues respectively. The goal of this study 
is to investigate the dependence of gas content on the galaxy 
properties of stellar mass, stellar surface density and NUV-r 


colour, extending the previous work of Fabcllo et al. (2011) 
down to lower stellar masses (M* ^ KUMq) and signifi¬ 
cantly increasing the number of galaxies. 

The key conclusions of this paper can be can be sum¬ 
marised as follows: 

1. We confirm that NUV-r colour excels over stellar mass 
and stellar surface density as a tracer of galaxy gas content, 


as previously noted by Cortese et al. (20111, Fabello et al. 
pOll ) and |Catinella et al.| ( |2013| ). Additionally, we quantify 
the strong decrease of gas fraction with increasing NUV-r 
colour at fixed stellar mass and stellar surface density. 

2. We show for the first time that the gas fraction-stellar 
mass and, to first order, the gas fraction-surface density scal¬ 
ing relations are driven by the primary correlation of gas 
content with NUV-r colour. 

3. At fixed NUV-r colour we find a small residual de¬ 
pendence on stellar mass and stellar surface density. This 
suggests a residual effect of mass and morphology on gas 
consumption at fixed SSFR, as discussed below. 

As already mentioned, one may regard NUV-r colour 
as a proxy for SSFR and thus, under the simple assump¬ 
tion that gas fraction and colour are derived over the same 
surface area, it is easy to show that the gas fraction-NUV-r 
relation can be interpreted as an integrated Hi Kennicutt- 
Schmidt (KS) law ( Schmidt||1959 Kennicutt|1998 1 relating 
the atomic gas content of galaxies to their star formation 
activity. In this context, our findings not only confirm that 
star formation is the property of galaxies most closely re¬ 
lated to their HI gas content, but also highlight that the 
main scaling relations of gas fraction-stellar mass, -stellar 
surface density and -NUV-r colour can be understood as a 
combination of the underlying bimodality in specific star for¬ 
mation and the KS relation. Low mass or disk galaxies are 
preferentially blue, star formers, whereas massive or bulge- 
dominated systems are, on average, more quiescent. This 
means that the gas fraction-stellar mass and -stellar surface 
density relations are simply driven by the variation in SSFR 
between the two populations. 

Given that NUV-r colour is the principle driver be¬ 
hind the main integrated gas fraction scaling relations, it 
is interesting that we find a residual dependence on stel¬ 
lar mass and surface density in the gas-fraction-NUV-r 


plane (see Figures 5b and 6b I. High mass and surface den¬ 


sity galaxies have lower Hi fraction than low surface den¬ 
sity objects at fixed NUV-r colour. If, as above, we take 
NUV-r as equivalent to SSFR, this shows that massive and 
bulge-dominated galaxies have a lower gas fraction than 
low mass and disk-dominated systems respectively for the 
same SSFR. It follows from this that the timescales over 
which gas is consumed by star formation (hereafter deple¬ 
tion time, Mhi/SFR) are shorter and therefore star forma¬ 
tion efficiency (SFE = SFR/Mhi) is higher in more massive 
or bulge-dominated galaxies than their low mass or disk- 
dominated counterparts, assuming that star formation con¬ 
tinues at its current rate. Our evidence contributes to a phys¬ 
ical picture where the gas content of galaxies is strongly 
regulated by star formation and, albeit to a lesser extent, 
influenced by both mass and structure. This is tentative ev¬ 
idence that the KS relation is to some degree dependent 
upon the morphological properties of the galaxy. 

Assuming that our results might extend to the molec- 
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Figure 7. Diagram outlining the dependence of SSFR on stellar 
mass. The scatter in the SSFR-M* relation is represented by the 
blue ellipse and the grey-blue dashed line denotes the star forming 
‘main sequence’. Galaxies classified as quiescent reside below the 
main sequence while star forming galaxies are found above. The 
pink dotted line shows a constant value of SSFR; the galaxies 
lying ‘below’ the main sequence, and thus deemed quiescent, are 
low mass systems while the high mass galaxies deviate ‘above’ 
the main sequence in their role as star formers. 


the mean relation. The increasing prominence of bulges in 
high mass galaxies leads to bulge-dominated galaxies being 
deemed stronger star formers in comparison with disks at 
fixed SSFR. Accepting this, Figure [ 7 ] illustrates how bulge- 
dominated galaxies lie above the specific SF main sequence 
and thus must have lower depletion time in comparison to 
disks at fixed SSFR, a result that is in agreement with the 


works of both Huang & Kauffmann (2014 1 and Saintonge 


et al. (2012). 


Of course it is important to offer more conclusive argu¬ 
ments on the relationship between gas and star formation in 
galaxies and to do so we must determine the physical star 
formation rates and efficiencies, rather than employing the 
proxy of NUV-r colour. This will be part follow-up papers. 

Our work introduces a more complete description of 
the relationships between galaxy properties and gas content, 
an area that provides strong constraints for galaxy forma¬ 
tion and evolution models by disentangling the influence of 
these properties on gas content. We demonstrate the impor¬ 
tance and power of HI spectral stacking, a technique with 
great potential for investigating the physical mechanisms 
that drive the evolution of galaxies by probing the cold gas 
content of huge galaxy samples in regimes that would oth¬ 
erwise be inaccessible until the full capability of the Square 
Kilometre Array is realised. 


ular gas (H 2 ) component, the secondary dependence of gas 
content on stellar mass and surface density is in qualita¬ 
tive agreement with Huang & Kauffmann (20141 who ex¬ 
amined the variation in H 2 depletion times in bulges, spi¬ 
rals, bars and rings, at fixed SSFR, for a sample of massive 
(M* > 10 10 M@) local galaxies. Their results show that, at 
fixed SSFR, the Ho depletion times are shorter for bulge- 
dominated galaxies. In their discussion they invoke the con¬ 
clusions of Heifer & Blitz (1993) as a possible explanation, 
whereby gravitational potential and the density of molecular 
clouds is increased in the presence of a stellar bulge. 

Conversely, Saintonge et al. (2012) find that for their 
sample of galaxies, a subset of which is analysed in [Huang] 


& Kauffmann (2014), the shortest depletion times and thus 


the highest SFEs are found in disk-dominated galaxies. This 
conclusion is reached by examining SFE as a function of dis¬ 
tance from the star formation (SF) main sequence, showing 
that weakly star forming galaxies have low gas content and 
long depletion times, and combining this with the derived 
H 2 KS law where high surface density systems lie systemat¬ 
ically below the mean relation. 

At face value, our result showing variation in gas con¬ 
tent with surface density seems contradictory to the con¬ 
clusion of Saintonge et al. (2012). However, it is easy to 
show that the inconsistency is only apparent. Both [Huang] 


& Kauffmann (20141 and this work examine depletion time 


at fixed SSFR while, on the other hand, [Saintonge et al.| 
( |2012[ ) leave SSFR unconstrained and investigate depletion 
time as a function of distance from the SF main sequence. 
Once one accounts for the difference in method our find¬ 
ings are entirely consistent with those of |Saintonge et al.] 
(20121. To illustrate this, the cartoon in Figure [TJshows that 
for fixed SSFR, high mass galaxies are clearly offset above 
the specific SF main sequence (see [Salim et al. 2007) while 
low mass galaxies are found in the quiescent region below 
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Table 2. Average gas fractions for the scaling relations shown in 
Figure [4] The column labelled y is the secondary property and 
limits by which we bin the sample. 

Table 3. Average gas fractions for the scaling relations shown 
Figure [5] Numbers preceded by a “<” sign are upper limits. 

in 

X 

V 

(x) 

(M ffl /M*> 

N 

X 

V 


(x) 

(M ffl /M*> 

N 

log M+ 

7 ^ log /i* < 8 

9.20 

1.722 

± 

0.029 

4204 

log M* 1 

^ NUV-r < 

3 

9.20 

1.805 ± 0.017 

4464 



9.59 

1.000 

± 

0.017 

2808 




9.62 

0.953 ± 0.021 

3988 



10.02 

0.589 

± 

0.018 

363 




10.10 

0.510 ± 0.008 

1684 











10.57 

0.318 ± 0.012 

429 











11.04 

0.186 ± 0.017 

39 


8 ^ log /x* < 8.8 

9.24 

0.735 

± 

0.030 

1226 

3 

s: NUV-r < 

5 

9.23 

0.355 ± 0.040 

898 



9.67 

0.429 

± 

0.008 

3733 




9.66 

0.286 ± 0.013 

2201 



10.13 

0.275 

± 

0.004 

3888 




10.15 

0.186 ± 0.003 

2763 



10.58 

0.174 

± 

0.002 

1488 




10.62 

0.127 ± 0.003 

1883 



11.03 

0.114 

± 

0.008 

119 




11.05 

0.079 ± 0.007 

364 


8.8 ^ log fii, ^ 10 

9.75 

0.168 

± 

0.029 

360 

5 

s: NUV-r s: 

8 

9.24 

< 0.258 

144 



10.19 

0.095 

± 

0.008 

2038 




9.70 

0.071 ± 0.016 

715 



10.64 

0.056 

± 

0.001 

2924 




10.17 

0.057 ± 0.004 

1843 



11.08 

0.032 

± 

0.002 

1092 




10.64 

0.026 ± 0.001 

2112 











11.09 

0.015 ± 0.001 

810 


log 

9 sc log M+ < 9.75 

7.43 

1.990 

± 

0.028 

2328 

NUV-r 9 ^ log M* < 9.75 

1.97 

1.979 

± 

0.028 

3819 





7.84 

1.151 

± 

0.010 

5070 




2.68 

0.910 

± 

0.033 

4480 





8.27 

0.538 

± 

0.016 

2222 




3.70 

0.286 

± 

0.029 

1119 





8.67 

0.288 

± 

0.016 

740 




4.80 

0.192 

± 

0.051 

707 





9.13 

0.476 

± 

0.120 

44 




5.63 

< 

0.123 

339 


9.75 

^ log M* 

< 10.5 

7.53 

0.993 

± 

0.088 

21 

9.75 

^ log M* 

< 10.5 

2.04 

0.865 

± 

0.023 

566 





7.95 

0.622 

± 

0.020 

1314 




2.81 

0.429 

± 

0.005 

3067 





8.31 

0.328 

± 

0.007 

2868 




3.74 

0.196 

± 

0.010 

2091 





8.74 

0.159 

± 

0.009 

3638 




4.82 

0.106 

± 

0.004 

1506 





9.18 

0.068 

± 

0.007 

1522 




5.82 

0.048 

± 

0.004 

2135 


10.5 

^ log M* 

s: 11.5 

8.38 

0.238 

± 

0.012 

252 

10.5 

^ log M* 

^ 11.5 

2.00 

0.304 

± 

0.041 

22 





8.80 

0.098 

± 

0.002 

1887 




2.95 

0.234 

± 

0.003 

548 





9.21 

0.037 

± 

0.002 

2360 




3.78 

0.120 

± 

0.005 

902 













4.80 

0.053 

± 

0.003 

849 













6.00 

0.021 

± 

0.001 

2187 
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Table 4. Average gas fractions for the scaling relations shown in 
Figure |6| Upper limits are preceded by a “<” sign. 


X 


y 


{x) 

(Mhi/M.) 

N 

log £ 1 * 

1 

< NUV-r < 

3 

7.43 

2.069 ± 0.026 

2223 





7.85 

1.238 ± 0.035 

5023 





8.27 

0.649 ± 0.012 

2417 





8.67 

0.435 ± 0.006 

810 





9.21 

0.358 ± 0.072 

70 


3 

<r NUV-r < 

5 

7.48 

0.346 ± 0.038 

122 





7.91 

0.366 ± 0.021 

1279 





8.31 

0.244 ± 0.020 

2445 





8.74 

0.162 ± 0.004 

3318 





9.16 

0.106 ± 0.005 

941 


5 

«: NUV-r s: 

8 

7.95 

< 0.318 

86 





8.35 

< 0.089 

480 





8.80 

0.050 ± 0.004 

2137 





9.21 

0.028 ± 0.001 

2915 


NUV-r 7 ^ log /i* < 

8 

1.97 

2.040 ± 0.037 

3353 



2.65 

0.969 ± 0.021 

3437 



3.64 

0.394 ± 0.049 

452 



4.71 

< 0.294 

115 



5.69 

< 0.512 

27 

8 < log < 

8.8 

2.02 

1.114 ± 0.041 

954 



2.81 

0.489 ± 0.009 

4304 



3.72 

0.203 ± 0.009 

2649 



4.78 

0.140 ± 0.018 

1506 



5.75 

0.048 ± 0.017 

1041 

8.8 < log ^ 

10 

1.91 

0.562 ± 0.110 

53 



2.95 

0.244 ± 0.013 

339 



3.83 

0.146 ± 0.009 

1008 



4.85 

0.071 ± 0.007 

1441 



5.93 

0.030 ± 0.002 

3591 





